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Abstract 

We propose a Dynamical Boolean Network (DBN), which is a Virtual Boolean 
Network (VBN) whose set of states is fixed but whose transition matrix can change 
from one discrete time step to another. The transition matrix Tk of our DBN for 
time step k is of the form Q~^TQ, where T is a transition matrix (of a VBN) 
defined at time step k in the course of the construction of our DBN and Q is the 
matrix representation of some randomly chosen permutation P of the states of our 
DBN. For each of several classes of such permutations, we carried out a number of 
simulations of a DBN with two nodes; each of our simulations consisted of 1,000 
trials of 10,000 time steps each. In one of our simulations, only six of the 16 possible 
single-node transition rules for a VBN with two nodes were visited a total of 300,000 
times (over all 1,000 trials). In that simulation, linearity appears to play a significant 
role in that three of those six single-node transition rules are transition rules of a 
Linear Virtual Boolean Network (LVBN); the other three are the negations of the 
first three. We also discuss the notions of a Probabilistic Boolean Network and a 
Hidden Markov Model — in both cases, in the context of using an arbitrary (though 
not necessarily one-to-one) function to label the states of a VBN. 

Key words: Internal Measurement; Boolean Network; Probabilistic Boolean 
Network; Hidden Markov Model; Linearity 



1 Introduction 



One of the simplest and most common ways to categorize an entity as living 
or non-living is to stipulate that if we were to decompose that entity into 
its constituent parts and could then reconstruct the whole by assembling the 
parts, it is non-living; otherwise, it is living. That is, a living thing is more 
than just the collection of its parts, but a non-living thing is not. Then how 
is the gap between living and non-living things expressed? 
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R. Rosen [14,15,16] claims that a central feature of living things is complexity, 
where a system is said to be complex if its behavior cannot be captured by 
models of that system; otherwise, that system is said to be simple. (That is, 
the complexity of a system depends primarily on the models of the system, and 
only secondarily on the system per se.) In Rosen's description, use of the term 
complex in reference to a system is equivalent to use of the term incomputable 
or notwell-formed in reference to its models. 

In his model of living things, Rosen considers a logical paradox (such as Rus- 
sell's paradox [10]) as a metaholism-repairsystem (MRsystem) [14]. An MR sys- 
tem consists of two sets (a set X of rawmaterials and a set Y of behaviors) and 
three functions (a metabolicf unction f E F = }iom{X,Y), a repairfunctiong G 
G = Hom(y, F), and a replicationfunctionh E H = Hom(F, G)), where g and 
h are onto functions and Y H. (For sets A, B, B.om{A, B) denotes the set 
of all morphisms from Aio B [11].) 

One remarkable feature of Rosen's model is the following: 

(i) There do not exist such onto functions g E G and h E H. If they are 
assumed to exist, we obtain a contradiction. 

One way to treat such a contradiction is to invoke hyperset theory [6]. A hy- 
perset is defined as a graphable set — in particular, one that can be represented 
by a hyperset diagram, which is a digraph (directed graph) with the property 
that every node is either a set contained in the hyperset or an element of such 
a set, and every edge is directed from some set S contained in the hyperset 
to one of the elements of S [1,2]. Thus edge a ^ b expresses the relation- 
ship a 3 b, i.e., b is an element of a. For example, the hyperset diagram of 
A = {A, b} is shown in Fig. 1. There are two directed edges: A 3 A and A 3 b. 
A relationship of the form A 3 A leads to the well-known contradiction known 
as Russell's paradox [10]; however, such a relationship is interpreted as just a 
loop structure in hyperset theory. 



A={A,b} b 



Fig. 1. Hyperset diagram of A = {A, b} 



Property (i) can be expressed in hyperset theory by its way. Let us represent 
Hom(y4, B) by the set {{A}, {A, B}}; thus F, G and H are represented by the 
sets {{X}, {X, r}}, {{r}, {F, F}} and {{F}, {F, G}} respectively. Then 
we obtain the hyperset diagram of MR as shown in Fig. 2 [3]. Accordingly, a 
system can be said to be complex if it cannot be well-formed in standard set 
theory but it can be well-formed in hyperset theory. 

Matsuno and Gunji, in their theory of Internal Measurement (IM) [12,13,4,5], 
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Fig. 2. The hypersetdiagram of MR 

propose a different way, called a perpetualequilibratingmechanism. They ad- 
dress property (i) as follows: 

(ii) Any solution to a problem (paradox) is inevitably a pseudo-solution, and 
the pseudo-solution at any given time step "always" triggers a problem 
that that is to be solved at the next time step. This leads to an evo- 
lutionary process that goes on forever and thereby allows for emergent 
properties. Therefore, the paradox is treated as the force that drives the 
time evolution of the system. 

In IM, the gap between the whole and the aggregation of the parts drives the 
time evolution of the dynamical system. Therefore, IM can be understood as 
a proposal of a dynamical system which is based on the idea of a perpetual 
loop between two different logical layers, such as the parts and the whole, with 
a decomposition p of the whole into the parts, and an integration (as a 
pseudo-solution) of the parts into the whole. Thus we may express an IM model 
in terms of a triad: the two different logical layers and a mediator /interface 
to adjust the two layers in an inconsistent manner [13,5]. IM claims that the 
relationship between the two layers (or the relationship between p and p~^) is 
not consistently determined, and for this reason they are perpetually changing 
relative to each another, where the term inconsistent means that we obtain a 
logical paradox if we assume consistency between them. 

Any finite Boolean Network (BN) has a fixed transition matrix, hence a finite 
BN ultimately reaches an attractor and stays there (i.e., the computation 
ends). In section 2, we define a Virtual Boolean Network (VBN) as a BN 
in which the set of incoming nodes for every node is the entire set of nodes 
of the BN, and we show that every finite BN can be treated as a VBN. In 
section 3, we introduce a labeling function to label the states of a VBN, and 
we discuss the Probabilistic Boolean Network (PBN) and the Hidden Markov 
Model (HMM) as extensions of a VBN. The purpose of an HMM is to find 
an optimal solution; in the case of a living thing, however, it is not always 
possible to find such a solution. Thus, in section 4, we abandon the attempt 
to find an optimal solution, and we introduce a Dynamical Boolean Network 
(DBN), which is a VBN whose set of nodes is fixed but whose transition matrix 
can change from one time step to another. Thus even a finite DBN does not 
necessarily have any attractors. In our DBN, a labeling function plays a role in 



3 



maintaining a perpetual loop between the of the VBN at different time steps. 

In section 4, we present a construction of a DBN — a construction that in- 
corporates stochasticity in a number of respects. In section 5, we present the 
results of a number of simulations of a DBN with two nodes. The purpose of 
each simulation was to investigate the set of rule vectors actually visited by 
the DBN, which is equivalent to the set of transition matrices actually vis- 
ited by the DBN. In subsection 5.1, we present the results of a simulation of a 
DBN based on the construction in section 4, which we call a type 1 simulation. 
The simulations presented in the remaining three subsections of section 5 use 
DBN constructions which are similar to the one in section 4, the only differ- 
ence being that, for > 2, we chose the transition matrix Tk of the DBN for 
time step k to be {Qk)~^TQk, where T is the transition matrix in our original 
construction and Qk is the matrix representation of some permutation of 
the states of the DBN. Those three simulations correspond to three different 
classes of permutations. In our type 2 simulation, Pk was randomly chosen 
from the set of permutations that consist of either the identity or a single 
2-cycle or the product of two disjoint 2-cycles. In our type 3 simulation, Pk 
was randomly chosen from the set of all 24 permutations of the four states of 
a DBN with two nodes. In our type 4 simulation, P^ was constructed from a 
labeling function. 

There are 81 rule vectors (out of a total of 256 possible rule vectors for a DBN 
with two nodes) that were not visited at all after the fifth time step of any trial 
in the type 1 and type 2 simulations. However, all 256 rule vectors were visited 
in every trial of the type 3 simulation, as well as in slightly more than half of 
the trials of the type 4 simulation. Indeed, six of the 81 aforementioned rule 
vectors were the ones visited most frequently of all in the type 4 simulation. 
A total of six single-node transition rules are associated with those six rule 
vectors (each rule vector corresponds to an ordered pair of single-node rule 
vectors). Three of those six single- node transition rules are transition rules of 
a linear VBN (LVBN); the other three are the negations of the first three. 
Thus our conclusion is that the type 4 simulation presents new insight into 
linearity of a VBN. 



2 Boolean Networks and Virtual Boolean Networks 

A Boolean Network (BN) [7,8,9] is a digraph in which each vertex has an in- 
ternal state and a transition rule. The vertices of a BN are hereinafter referred 
to as nodes. Each internal state has a value of either or 1 at any given time. 
The transition rule for a given node specifies the next state of that node as a 
function of the current internal states of its incoming (input) nodes. 
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A BN can be finite or infinite. The discussion of BN's in this paper is hmited to 
finite BN's (i.e., those that have finitely many nodes). Some of the assertions 
made herein about BN's do not extend to the infinite case. 
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Fig. 3. Example of a BN 

An example of a BN is shown in Fig. 3. There are three nodes (a, b, c), so the 
BN has a total of 8 (= 2'^) possible states: 

(0,0,0), (0,0,1), (0,1,0), (0,1,1), (1,0,0), (1,0,1), (1,1,0), (1,1,1) 

The set of incoming nodes for each node of a BN is specified by the digraph. 
Here, the sets of incoming nodes for a, b, and c are {c}, {a}, and {a, 6}, 
respectively. Since nodes a and b have just one incoming node apiece, each of 
them has just two possible input states: and 1. Node c has two incoming 
nodes, hence it has four possible input states: (0,0), (0,1), (1,0), (1,1). 

Given the initial state of a BN, we obtain the time evolution of the system 
(the sequence of states visited) by repeated application of the transition rules. 
Since the number of nodes is finite, the number of possible states is finite, 
so for every initial state the system evolves to some finite cyclic sequence of 
states, i.e., a finite sequence of states of the BN which is visited ad infinitum 
from some time on, with no other states intervening. Such a sequence is called 
an attractor. Clearly, each state evolves to one and only one attractor. Thus 
the set of states of a BN can be partitioned in such a way that states in the 
same block of the partition evolve to the same attractor, and states in different 
blocks evolve to different attractors. 



All possible states and attractors of the BN in Fig. 3 are depicted in the 
transition diagram in Fig. 4. In this particular BN, only four of the states are 
visited infinitely often, and there are two attractors: one of length 3 ((0, 0, 0) 
(0, 1, 1) ^ (1, 1, 0)) and one of length 1 ((0, 1, 0)). (The length of an attractor 
is the number of states in the cycle.) The set of states can be partitioned as 
Si U S2, where 

5i = {(0,0,0), (0,0,1), (0,1,1), (1,0,0), (1,0,1), (1,1,0), (1,1,1)} 

and 

^2 = {(0,1,0)}, 
and every state in 5"! evolves to the attractor of length 3. 
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(1.0.1), 
(0,0,1) Kl.!.!)- 



-Xl.0,0)- 



(0.1.1) 



-> (0,0.0) 



(1.1,0) 

Attractor (length = 3) 



(O.l.O)Q 



^"1= {(0,0,0), (0,0,1), (0,1,1), (1,0,0), (1,0,1), (1,1,0) , (1,1,1)} 



Attractor 
(length = 1) 

V ! 

{(0,1,0)} 



Fig. 4. Transition diagram of the BN in Fig. 3 

A digraph with node set V is specified by assigning to every node a set of 
incoming nodes. Assigning a set of incoming nodes to a single node can be 
regarded as selecting an element of Hom(l/, 2), where Hom(yl, B) denotes the 
set of all morphisms from A to B [11]. (Here, 2 denotes the set {0, 1}.) The 
term morphism as used here is synonymous with function; hence Hom(V^, 2) is 
the set of all binary functions (functions whose values are elements of the set 
2 = {0, 1}) with V as domain. Each such binary function can be represented 
as a binary sequence (a sequence of O's and I's) in which the jth term is 
the value of the function at node j. The set of incoming nodes for node i is 
represented by the binary sequence in which the jth term is 1 if node j is an 
incoming node for i, and otherwise. Thus assigning to every node in 1/ a set 
of incoming nodes (i.e., selecting an element of Hom(y, 2) for every node) can 
be regarded as selecting an element of Hom(V, Hom(l/, 2)). 

Alternatively, specification of a digraph can be regarded as selecting an ele- 
ment of Hom(l/ X V,2), since 

Hom(y X y, 2) ~ Hom(y, Rom{V, 2)) (1) 

via the mapping of G Hom(l/ x V, 2) to the element of Hom(V, Hom(y, 2)) 
which is defined by (^(t'l) = (p{vi, — ). For every vi G V, denote ifi^Vi) by 0^^^. 
Then G Hom(y, 2) and, for every V2 G V, 

Note that 

0vi{v2) = 1 < — > V2 is an input node for vi < — > (fi{vi,V2) = 1 



The internal states of the nodes of a BN with node set V can be regarded as 
the components of a state x = {xi,X2, ■ ■ ■ ,x\v\), where \V\ is the number of 
nodes in V and Xi is the internal state of node i (i.e., xi,X2, ■ ■ ■ ,x\v\ G 2). 

There are two ways to specify the transition rules for a BN. The first method 
is completely general. 

Method 1 The transition rule fi for node i assigns the next internal state 
(either or 1) to node i. The assignment is a function of the current internal 
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states of the nodes in the set W{i) of incoming nodes for i. 

LetW{i) = {ii, ^2) ■ • • 5 where \W{i)\ is the number of nodes in W{i). 

Then 

fi G Hom(Hom(VF(z),2),2), 

where 

Hoin(VK(i) ,2) { {xi-^ , > ■ • • > ) ■ "^h ' ■^h ' ■ • • ' '^*|w(i)i ^ ^} 



The transition rules for the nodes of a BN can be regarded as the components 
of a rule vector f : 

f= ifij2,...J\v\) e n Hom(Hom(W0-),2),2) 

jGV 



The second method of specifying the transition rules apphes only to a BN 
for which W{i) = V (hence W{i) = 0) for every node i, where W{i) denotes 
V — W{i). We will refer to such a BN as a Virtual Boolean Network (VBN). 

Method 2 // W{i) = V for every i, then the transition rules fi for the indi- 
vidual nodes can be incorporated into a single transition rule F: 

F e Hom(y,Hom(Hom(y,2),2)), F(z) = G Hom(Hom(y, 2), 2) (2) 



For node i, denote F{i) by Fi. For j G V, we will say that node j is vir- 
tually disconnected from node i if Fi{. . . ,yj, . . .) is invariant with respect to 
substitution of input yj with ~ yj, where 

yj = ^ ~ yj = 1, = 1 ^ ~ yj = 

We will say that the nodes which are not virtually disconnected from node 
i are virtual incoming nodes for node i, and we will denote the set of vir- 
tual incoming nodes by VW{i)] hence the set of nodes which are virtually 
disconnected from node i is VW{i) = V — VW{i). 

Any BN can be regarded as a VBN, by defining Fi in such a way that it 
extends fi and all the nodes in W{i) are virtually disconnected from i. 

Example 1 Let V = {a, b, c}, and fix a node i. Suppose that W{i) = {a} and 
W{i) = {b,c}, and that the transition rule fi for node i is 

f^ 



In (Xa) 


1 


Out (xj) 


1 
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The domain of rule fi is just the set of states of node a, but fi can be embedded 
into rule Fi defined by 



In 




(0,0,0) (0,0,1) 


(0,1,0) (0,1,1) 


Out 




1 1 


1 1 


In 




(1,0,0) (1,0,1) 


(1,1,0) (1,1,1) 


Out 


{Xi) 









The domain of Fi is the set of states of all the nodes. Note that, for all 
Xa,Xb,Xc G {0,1}, the embedding satisfies 

Fi{Xa, Xb, Xc) = Fi{Xa, ~ Xb, = Fi{Xa, Xb, ~ xj = Fi{Xa, ~ Xfo, ~ Xc) 

Hence nodes b and c are virtually disconnected from node i. 

We can translate a specification of a VBN into a specification of a BN, though 
the resulting BN is not necessarily unique (since W{i) can be any superset 
of VW{i)). For node i in Example 1, we could define W{i) as {a, b}, and the 
transition rule Qi for node i as 
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In {Xa,Xb) 

Out (x,) 



(0,0) (0,1) (1,0) (1,1) 











Note that, for all Xa, Xb G {0, 1}, 

fi{Xa) = gi{Xa, Xb) = g{Xa, ~ Xb) 



Furthermore, both and gi embed into the transition rule Fi given in Exam- 
ple 1. 

The transition diagram of a BN is a digraph in which the vertices are the states 
of the BN and every vertex has just one out-going edge. If the set of nodes 
of the BN is V, then the BN has a total of 2'^' states. Thus the transition 
diagram can be represented as a 2l^l X 2l^l adjacency matrix . The adjacency 
matrix T of any digraph is a binary matrix (i.e., a matrix of O's and I's) such 
that Tij = 1 if there is an out-going edge from the vertex labeled i to the 
vertex labeled j (and Tij = otherwise). Since each state of a BN has just one 
out-going edge, exactly one of the entries in each row of the adjacency matrix 
of a BN is a 1. In the case of a BN, the adjacency matrix is also called the 
transition matrix. 
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Proposition 2 The transition matrix T of a BN with vertex set V can he 
obtained from the rule vector f ; however, f cannot be recovered from T. 

Proof (/ ^ T): Label the rows of T with the states of the BN, and use the 
same labels (in the same order) for the columns of T. Let x = (xi, X2, . . . , x\v\) 
be a state of the BN. Then, in the row of T which is labeled with state x, 
the unique 1 is located in the column labeled with state y = {yi,y2, ■ ■ ■ , y\v\), 
where, for every i, yi is computed by applying transition rule fi to x. 
(T 7^ /): To determine transition rule fi, its domain (namely, W{i)) must be 
known; however, W{i) cannot be derived from T. ■ 

The following proposition follows from the fact that, for every node in a VBN 
with node set V, the set of incoming nodes is V. 

Proposition 3 In a VBN, the transition matrix T can be derived from the 
transition rule F , and F can be derived from T . 

Proof By Proposition 2, it suffices to prove that T ^ F. Assume that the 
rows and columns of T are labeled with the states of the VBN, as indicated in 
the proof of Proposition 2, and let i E {1,2, . . . ,\V\}. The output of transition 
rule Fi, which is the next internal state of node i, is a function of the current 
state of the BN, so let x be a state of the VBN. Furthermore, let j be such 
that, in the row of T which is labeled with state x, the unique 1 is located 
in column j, and let y = {yi,y2, . . . , y\v\) be the state that labels column j. 
Then yi is the output of Fi that corresponds to input x. ■ 

From this point on, we assume that every BN is a VBN, so we need not specify 
the set of incoming nodes for any node. Every state x of a VBN with node 
set V is an element of Hom(y, 2), and the transition rule F for such a VBN 
is an element of Hom(V, Hom(Hom(y, 2), 2)). 

Definition 4 A VBN is said to be a linear VBN (LVBN) if, for every node i, 
transition rule Fi is of the form 

Fi{x) = (x ■ Vi) mod 2, 

where x is the current state of the VBN and Vi is a vector whose components 
are 's and 1 's. ( Clearly, the number of components of Vi must be equal to the 
number of nodes in the VBN.) 

Proposition 5 In an LVBN, node i is virtually disconnected from node j if 
and only if the jth component of the vector Vi in Definition 4 is 0. 

Proof The output of transition rule Fi (namely, {x ■ Vi) mod 2) is invariant 
under the transformation that consists of replacing the current internal state Xj 
of node j with ~ xj (and leaving the current internal states of the other nodes 
unchanged) if and only if the jth component of is 0. ■ 
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For a VBN with set of states S and node set V, the number of possible single- 
node transition rules is 2l'^l = 2^'^'. In the case of a VBN with two nodes, this 
is = 16. 

The output of each of the 16 possible single-node transition rules for a VBN 
with two nodes is given in Table 1 as a function of the input state of the VBN; 
tabulated along with the output of each transition rule is the number n of 
virtual incoming nodes for each of the node(s) to which that rule would be 
applied. There are only four possible transition rules for a node of an LVBN 
with two nodes (namely, the rules which are presented in boldface in the 
table). Those four transition rules (1, 7, 11, and 13) correspond to the vectors 
■u = (0, 0), V = (1,1), V = (0, 1), and v = (1, 0), respectively, where v is as in 
Definition 4. If nodes 1 and 2 of an LVBN are a and b (in that order), then 
the sets of virtual incoming nodes for each of the nodes to which rules 1, 7, 
11, and 13 would be applied are 0, {a,b}, {b}, and {a}, respectively. 

Table 1 



Outputs of the transition rules for a VBN with two nodes. 



Input\Rulc no. 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


16 


(0,0) 





1 





1 





1 





1 





1 





1 





1 
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(0,1) 
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1 
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1 
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1 
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1 


(1,0) 














1 


1 


1 


1 














1 


1 
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(1,1) 


























1 


1 


1 


1 


1 


1 


1 


1 


n 





2 


2 


1 


2 


1 


2 


2 


2 


2 


1 


2 


1 


2 


2 






3 Labeling Functions 

In section 2, we found that the transition matrix T of a BN has the property 
that exactly one entry in each row is a 1. We will use the term Boolean matrix 
for every square binary matrix that has exactly one "1" in each row. By 
Proposition 3, for every tj, > 1 there is a natural one-to-one correspondence 
between the set of all 2" x 2" Boolean matrices and the set of all VBN's with 
node set {1,2,..., n}, provided that the rows and columns of all the Boolean 
matrices are labeled in the same way (e.g., with the states of such a VBN) 
and in the same order. 

For a VBN with node set V, let S be the set of states, let 

A = {l,2,3...,2l^l}, 
and let S : — s> A be a function that assigns a label to each state. 
For any state sq of the VBN, let 

So, Si, S2, ... 
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be a sequence of states such that T(sj) = Sj+i for every i, where T is the 
transition matrix of the VBN and T(sj) is the unique output state for input 
state Si- Furthermore, let 

ao, ai, a2, ■ ■ ■ , 

be the sequence of labels of the corresponding output states. Since T is a 
transition matrix of a VBN, there is just one such sequence of states with sq 
as first term (hence just one such sequence of labels with ao as first term). 

There are digraphs with underlying set S whose adjacency matrix is not 
Boolean. Thus for some state Sq, there are at least two different sequences 
of states with first term sq (hence there are at least two different sequences of 
labels with first term ao). For the digraph shown in Fig. 5, for example, there 
are two sequences of labels with first term 1: 1,2,2,2,. . . and 1,3,3,3,. . .. The 
adjacency matrix of that digraph is not Boolean, because state 1 can make 
a transition to either state 2 or state 3, so the digraph is not a transition 
diagram of any BN. 




Fig. 5. Digraph that has two sequences of labels with first term 1 

For any given sequence of labels, there exists a unique digraph, which we will 
call the output digraph. For example, the output digraph for the finite sequence 
of labels 

1,2,1,2,2,1,1,1 (3) 

is shown in Fig. 6. The adjacency matrix of that digraph is not Boolean, since 
state 1 can make a transition to either state 1 or state 2 (as can state 2). 
Therefore, the digraph is not a transition diagram of any BN. 

Fig. 6. Output digraph for the sequence of labels in (3) 

There are at least four ways to treat a finite sequence of labels of vertices 
of a digraph, such as the sequence in (3). In each of those four methods, 
the labels are treated as the values of a deterministic or stochastic labeling 
function H whose domain is a subset of the set of states of some VBN or 
Probabilistic Boolean Network (PBN). In a PBN, the transition rules are 
regarded as random variables, and the transition matrix is a probabilistic 
transition matrix [17,18]. 

The first way is to use a VBN and a deterministic labeling function. For 
example, we could use a VBN in which there is a sequence Sq, Si, S2, . . . , Sn-i 
of distinct states such that T(si) = Sj+i for every i < n — 1, where n is the 
number of terms in the given sequence of labels and T is the transition matrix 
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of the VBN. The sequence in (3) has 8 labels, so a VBN with a sequence 
of 8 distinct states will suffice (e.g., the VBN with 3 nodes whose transition 
diagram is shown in Fig. 7). 

(0,0,0) ►(0,0,1) ►-(0,1,0) ^-(0,1,1) (1,0,0) (1,0,1) ►-(1,1,0) ►(1,1,1)'^ 

Fig. 7. Transition diagram of a VBN with a sequence of distinct states that can be 
associated with the sequence of labels in (3) 

Once the VBN with the sequence of n distinct states is constructed, we label 
state Si with the original label Oj, i.e., we define the following many-to-one 
deterministic labeling function H: 

[ (0,0,0) (0,0,1) (0,1,0) (0,1,1) (1,0,0) (1,0,1) (1,1,0) (1,1,1) 
12 12 2 111 



Though we can recover the output digraph from the original sequence of labels, 
the fact that a sequence of distinct states of a VBN was used as the domain 
of the labeling function S is lost, so this way of treating a finite sequence of 
labels of vertices of a digraph is rather artificial. 

The second way is to use a VBN and a stochastic labeling function S. For the 
sequence in (3), we could use a VBN with just a single state, (0), as shown 
in the left half of Fig. 8, and the stochastic labeling function S portrayed in 
the right half of the figure, which assigns the label "1" with some non-zero 
probability r and the label "2" with probability 1 — r. 



(0). 




Fig. 8. Representation of the digrapli in Fig. 6 by the simplest VBN (left) and a 
one-to-many (stochastic) labeling function E (right) 

We can recover any output digraph in this way, that is, by choosing a suitable 
stochastic labeling function S on the set of states of an arbitrary VBN. How- 
ever, just as with the first method, the fact that there is an underlying VBN 
is lost. 

The third way to treat a finite sequence of labels is to use a PBN and a 
deterministic labeling function S. For the sequence of labels in (3), for example, 
we construct the digraph in Fig. 6 and then define a stochastic transition 
matrix T by assigning a weight (probability) to each edge. For example, we 
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could use the matrix 



T 



( 



0.4 0.6 j 



0.2 0.8 



If we use the deterministic function S that labels the first state of the PBN 
with "1" and the second state with "2", we obtain the PBN in Fig. 9. 



Fig. 9. PBN that can be associated with the sequence of labels in (3) 

The fourth way is to use a PBN and a stochastic labehng function. Thus we 
define stochastic matrices T and H. If S is the set of states of the PBN and A 
is the set of labels, then the following hold for every state s: 



where T^^' is the probability that the PBN makes a transition from state s to 
state s' and is the probability that state s is labeled with a. 

Such an entity is an example of what is known as a Hidden Markov Model 
(HMM), a statistical model in which a system is assumed to be characterized 
by a stochastic transition matrix T whose rows and columns are labeled with 
the states of the system and a stochastic matrix H (known as the emission 
matrix) whose rows and columns are labeled with the values of some variable 
a that takes only finitely many different values. Typically, the values of a 
are observable, but the matrices T and S are unknown and the states of the 
system are hidden (not directly observable). 

The following is an iterative process that, given a positive integer n and a 
sequence a^"^ = (ao, . . . , of empirically determined values of a, ap- 

proximates the matrices T and H and finds an appropriate sequence s*-"-* = 
(sO) ■ ■ • 5 Sn-i) of states of the system such that state Sj can be assigned the 
value Qfj. 

Step 1 Define a nonnegative function || ■ || such that, for all pairs (a, /3) of finite 
sequences of values of a with equal numbers of terms, 1 cJ| | is a measure 
of the difference between a and (3. Then set an upper limit e (> 0) on 
the acceptable value of 1 1 ■ 1 1 for such pairs of sequences. 

Step 2 Input a positive integer n and an empirically determined sequence a^"^ = 
(q;o, • • • , «n-i) of values of a, and randomly choose a sequence s*^"^ = 
(so, . . . , Sn-i) of states of the system. 

Step 3 Input the sequences a^"^ and s^"^ to an algorithm p that outputs the most 
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likely pair (S, T) of stochastic matrices for the system: 

p(5W,5-<n))_,(S, T) 

Step 4 Input the sequence a*^"-* and the estimated pair (H, T) to an algorithm a 
that outputs the most likely sequence t^"^ = (to, • • • , tn-i) of states of the 
system: 

cT(a("\ S, T) ^ t>) 

Step 5 Input the sequence t^"-* and the estimated pair (H, T) to an algorithm 7 
that outputs the most likely sequence = (/3o, • • ■ , /?n-i) of values of a: 

7(t^"),H,T) ^ 

^tep 6 Compute the quantity 

If 5 < e, output s^") as the sequence of states that corresponds to the 
sequence a*-"-* of values of a, and then halt. Otherwise, set s^"-* to t^"-* and 
go to Step 3. 

The purpose of HMM is to find an optimal solution with 5 < e. In a living 
thing, it is not always possible to find such a solution. Thus at this point we 
will terminate our discussion of the four methods of treating a finite sequence 
of labels of a digraph which we have presented in this section. In section 4, we 
will introduce a fifth method (namely, a Dynamical Boolean Network (DBN)) 
as an alternative to all four of them, which does not depend on satisfying the 
condition S < e. 



4 Dynamical Boolean Networks 

In this section we construct a Dynamical Boolean Network (DBN), which 
we define as a VBN whose transition matrix can change from one discrete 
time step to another. The number of nodes of the DBN remains constant 
throughout; hence the set of states is invariant. 

Our method of constructing a DBN is completely general. We will apply it to 
an example of a DBN with 2 nodes, so there are 4 states. At each time step, 
we will consider a sequence of 5 states, hence at least one of the 4 states of 
the DBN is repeated in the sequence. 

[Step 1] Let k index the time steps, and initialize k to 1. Fix the number 
of nodes of the DBN, and choose the initial labeling function Si : — ^ A at 
random (where S is the set of states of the DBN and A = {1,2,3,. ..,2^^}). 
Next, randomly choose the initial transition matrix Ti, and a sequence of states 
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of the VBN whose transition matrix is Ti. Then determine the sequence 
of labels di which corresponds to 5*1 via the function Si, and construct the 
output digraph g = (V, E) for di, where V and E are the vertex set and edge 
set, respectively. 

In our example, /i = 2. For our initial labeling function, we will choose the 
many-to-one function 



(0,0) (0,1) (1,0) (1,1) 



Such a labeling function partitions the set of states into groups and assigns a 
different label to each group. For our initial transition matrix, we will choose 



/^O 1 0^ 
10 
1 
10 



which corresponds to the transition diagram in Fig. 10(a), and we will use 

5i = (1,0), (1,1), (0,0), (0,1), (0,1) 



as our sequence of states. 

(1.0)^^(1.1) 



(0.0) 




1 ^ 



Fig. 10. (a) Initial transition diagram of a DBN which corresponds to transition 
matrix Ti and yields the sequence of states Si . (b) Output digraph for the sequence 
of labels ai, without labels, (c) Output digraph for ai, with labels. 

The sequence of labels cJi which corresponds to 5*1 via the function Si is 

di = 1,2,1,2,2 

The unlabeled and labeled versions of the output digraph for di are shown 
in Figs. 10(b) and 10(c), respectively. Clearly, the adjacency matrix of the 
output digraph is not Boolean. 

[Step 2] Use g to construct a set Q of digraphs in which every vertex has just 
one out-going edge (hence ^ is a set of digraphs whose adjacency matrices are 
Boolean). If every vertex of g has just one out-going edge, let Q = {g} and go 
to Step 3. Otherwise, proceed as follows: 
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(i) Select a vertex xi that has out-degree at least 2, let d be the out-degree of 
Xi, let yi, ?/2, . . . , ?/d be the vertices to which xi is connected via an out-going 
edge, and let 0:2, iCa, • • • , be new vertices (not in V). If there is a loop at Xi 
(i.e., if {xi,Xi) G E), then, for the sake of convenience, set yi to Xi. Assign 
entirely new labels (labels that have not yet appeared in the construction 
of our DBN) to the d — 1 new vertices. 

In the output digraph in Fig. 10(c), there is just one vertex with at least 
two out-going edges (namely, the vertex labeled with 2), so Xi is that vertex. 
There are two out-going edges from Xi: one to Xi itself (which is therefore 
also Ui) and one to the vertex labeled with 1 (which is thus 2/2)- Hence d = 2, 
so we add d — 1 = 1 new vertex, X2, which we will label with 2'. 

(ii) For every one-to-one function 



4> ■ {X1,X2, 

construct the new digraph g 
{x2, . . . , Xd} and edge set 



.Xd} {2/1,2/2, • • • ,2/d}, 
(V^, E^) that has vertex set 



V U 



E^ = i^E- {(xi, 2/1), (xi, 2/2), • • • , {xi,yd)}^ 

y {(Xi, 0(Xi)), (X2, (f)ix2)), . . . , (Xd, 0(Xd))} 

The number of digraphs constructed here is d\ . 

In our example, we construct d\ = 2! = 2 new digraphs, gi and g2, one 
for each one-to-one function : {xi, X2} {yi, 2/2}, as shown in Fig. 11. 





Fig. 11. The two digraphs constructed in Step 2(ii) in our example 

(iii) If g has a loop at Xi, construct additional new digraphs: For every i G 
{2, . . . ,d} and every one-to-one function 

(pi : {xi,X2, . . . ,Xd} {Xi,y2, ■ ■ ■ ,yd}, 

construct the new digraph g^. = (y^.,E^.) that has vertex set V^^ = V U 
{x2, . . . , Xd} and edge set 



E<l>^ = [E- {(xi, 2/1), (xi, 2/2), • • • , {xi.yd) 

U {(Xi, 0i(xi)), (X2, 0i(x2)), . . . , (Xd, (piiXd))] 

For every z G {2, . . . , c?}, the number of digraphs (/^. is d!, hence the sum of 
the numbers of digraphs g^^ for alH G {2, 3, . . . , d} is {d — l)-d\. Note that 
every new digraph constructed in Step 2(ii) or Step 2 (iii) has the property 
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that if we identify all the vertices xi,X2, ■ ■ ■ ,Xd, then that digraph collapses 
to g. 

Since the output digraph g in our example has a loop at Xi, we construct 
{d—l)-d\ = (2 — 1) ■ 2! =2 additional new digraphs, g[ and g2, one for each 
one-to-one function 02 : {xi,X2} {x2,y2}, as shown in Fig. 12. 



Fig. 12. The two digraphs constructed in Step 2(iii) in our example 

(iv) If each of the new digraphs just constructed in Step 2(ii) or Step 2(iii) has 
only one out-going edge, let Q be the set that consists of all of those digraphs 
and go to Step 3. Otherwise, go to Step 2(i) and apply this procedure in 
parallel to all of the new digraphs just constructed (by treating each of them 
as g). 

Since Xi is the only vertex in our output digraph g with at least two 
out-going edges, every vertex of each of the four new digraphs constructed 
in Step 2(ii) or Step 2(iii) has just one out-going edge, so we go to Step 3. 

[Step 3] The digraphs in Q all have the same number A of vertices, and their 
adjacency matrices are all Boolean. If A < 2^, let z/ = 2^ — A and, for every 
digraph h ^ Q, construct all digraphs that are formed from h by adding u 
new vertices zi, Z2, ■ ■ ■ , and assigning just one out-going edge to each new 
vertex. Then all of the resulting digraphs have 2'^ vertices, and their adjacency 
matrices are transition matrices of actual VBN's. We will call each of those 
digraphs a pseudo-transition diagram. 

In our example, all of the digraphs in Q are shown in the left-hand column of 
Fig. 13. For each digraph, all four of the pseudo-transition diagrams formed 
from it (by adding one new vertex, Zi, and assigning just one out-going edge 
to Zi) are shown in the columns to the right. 

[Step 4] Choose a pseudo-transition diagram g' from Step 3, and, as indicated 
in what follows, use the function to label the vertices of g' with the states 
of a VBN that has n nodes. Since S^, is not necessarily a one-to-one function, 
it does not necessarily have an inverse, so randomness will be incorporated 
into the labeling process. 

For every / G A, let S^i = {s & S : Sfc(s) = /}. 

(i) Let Z G A. If Ski is nonempty and some vertex v of the output digraph g 
(the digraph g from Step 1, if A; = 1; the digraph g from the most recent 
execution of Step 7, if A; > 1) is labeled with /, let d be the out-degree of v 
in g. 
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Fig. 13. The four digraphs in Q which are constructed from the output digraph 
in Fig. 10 are shown in the far-left column. The corresponding pseudo-transition 
diagrams for each digraph are shown to the right. In Step 4, we have chosen the 
pseudo-transition diagram in the far-right column which is indicated by the solid 
border. 

We show that d < \Ski\'- This is obvious if li = 1, since Ski is nonempty. 
If (i > 2, then at some point in the construction of our DBN, v was chosen 
as Xi in Step 2(i), so v has out-going edges to distinct vertices yi, . . . ,yd of g. 
Those vertices have distinct labels in g, so they correspond to distinct states 
ti,t2, ■ . . ,td in S. Thus there are states si, S2, . . . , in Sm such that the out- 
going edges from v to yi, ... ,yd in g correspond to distinct transitions 

Sl — >■ ti, S2 ^ t2, . . . , Sd ^ td 

of the VBN whose transition matrix is T^. Then the states Si, . . . ,Sd are 
also distinct. (To see this, let i,j be distinct elements of {1,2,..., d}. Then 
U 7^ tj, and the transitions Si U and sj — > tj are distinct, so Si ^ sj, since 
a VBN can make only one transition from a given state.) 

If d = 1, randomly assign some state s in Ski as the label of v, and let 
Dki = {s}. If d > 2, let X2, . . . ,Xd be the d—1 new vertices added in Step 2(i) 
when V was chosen as Xi, and randomly assign states Si, S2, ■ ■ ■ , Sd from Ski 
as the labels of the vertices Xi, . . . , x^; then let Dki = {si, S2, ■ ■ ■ , Sd}. 
(ii) Randomly assign the states in 5" — U«eA Dki as the labels of the remaining 
vertices. 

Let T be the transition matrix of the VBN whose transition diagram was 
constructed by relabeling the vertices of the pseudo-transition diagram g'. 

In our example, the following is what transpires in Step 4: 
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We assign states to the vertices of g' that are labeled with 1, 2, and 2'. 

• For / = 1, 5*1; = {(0, 0), (1, 0)}, and the vertex of g which is labeled with 
/ has out-degree d = 1. We randomly assign (1,0) as the label of vertex 1 
in g\ so Dn = {(l,0)}. 

• For / = 2, Sii = {(0, 1), (1, 1)}, and the vertex of g which is labeled with 
/ has out-degree 2. We randomly assign the states (0, 1) and (1,1) as the 
labels of vertices 2 and 2', respectively, and we have Du = {(0, 1), (1, 1)}. 

• For / = 3 and / = 4, we find that Su = 0, since the image of the labeling 
function Hi is {1,2}. 

We assign a state to the vertex of g' that is labeled with Zi. 



S-[jDu = S-[DnlJD 



12J 



= {(0, 0), (0, 1), (1, 0), (1, 1)} - [{(1,0)} U {(0, 1), (1, 1)}] 
= {(0,0)} 

Hence we assign the state (0,0) as the label of vertex zi in g'. 

Thus we obtain the following one-to-one function E[ : S ^ {1, 2, 2', 2:1}: 



(0,0) (0,1) (1,0) (1,1) 



2' 



Applying the inverse of S'^ to the selected pseudo-transition diagram g' from 
Step 3, we transform g' into the transition diagram of a VBN with 2 nodes, 
as shown in Fig. 14. That is only one of the four possible transition diagrams 
that we could have obtained — there are two ways to select one state in Si as 
the label of vertex 1, and two ways to assign the two states in ^2 as the labels 
of vertices 2 and 2'. 



g' 




Ci.O),^=>-CO.i) 

J 



Fig. 14. Transformation of the chosen pseudo-transition diagram g' into the transi- 
tion diagram (of an actual VBN) which is obtained by applying the inverse of the 
randomly chosen function H'^ to label the vertices of g' with the states (of the VBN) 

The transition matrix of the VBN whose transition diagram is shown in Fig. 14 
is 



T 



1 0^ 
10 
10 
ly 
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[Step 5] Begin the next time step (i.e., increment k by one unit). Then use the 
correspondence between the terms of Sk-2 and the terms of a^^i to construct 
a frequency table as follows (if k = 2, then Sk-2 (= So) does not exist, so 
first randomly choose a sequence of states 5*0 of a VBN that has n nodes): For 
every pair (s, I) ^ S x A, let #(s, /) be the frequency of / as the label of s; 
that is, #(s, I) is the number of terms t of Sk~-2 such that (a) s = t and (b) / 
is the corresponding term of fJfc-i- Enter the value of #(s, I) in the frequency 
table. 

Once the frequency table is constructed, use it to define the labeling func- 
tion Sfc : — > A: Let s G S", and let rUs denote the highest frequency associated 
with s in dk-i, i.e., 

nis = max{#(s, I) : I E A} 

(Note that = if and only if s is not a term of Sk-2-) Randomly choose 
some element of the set {/ G A : #(s, /) = m^} as the value of Sfc(s). (Note 
that if rUs = 0, then {/ G A : #(s, /) = = A.) 

For time step 2 in our example, we randomly choose Sq as 

^0 = (1,0), (0,0), (0,1), (0,0), (0,1) 

Next, we obtain the correspondence between the terms of Sq and the terms 
of cti, which is shown in (4) and gives rise to the frequency table (Table 2). 



^0 



1,0) (0,0) (0,1) (0,0) (o,i; 



Table 2 

Frequency distribution table derived from tlie correspondence 



(4) 



aetween 5*0 and ai. 



Label\State 


(0,0) 


(0,1) 


(1,0) 


(1,1) 


1 





1 


1 





2 


2 


1 








3 














4 















Then we construct S2 from Table 2 as follows: 



s = (0, 0) ^ nis = 2; there is just one / G A (namely, 2) with #(s, I) = nig, 
hence ^2(5) = 2. 

s = (0, 1) ^ ?77,s = 1; there are two elements I of A (namely, 1 and 2) with 
7^(s,/) = nig, hence we randomly assign one of them (2) as the value of 

^2{S). 
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s = (1, 0) ^ = 1; there is just one / G A (namely, 1) with I) = rus, 
hence ^2(5) = 1. 

s = (1, 1) ^ nis = 0, so s is not a term of Sk-2, hence we randomly assign 
an element of A (namely, 2) as the value of ^2(5). 



Thus we obtain 



^2 



(0,0) (0,1) (1,0) (1,1) 



This is only one of eight possible labeling functions that we could have used: 
there are two possibilities for the value of ^2(0,1), and four possibilities for 
the value of ^2(1, 1). 

[Step 6] Let Tk = T. 

In our example, 



To 



10 
10 
10 
1 



[Step 7] Determine the sequence of states Sk for time step k: Let the first 
term of 5*^ be the same as the first term of 5*1, and then apply the transition 
matrix to obtain the remaining terms of Sk- Once 5*^ has been computed, 
determine the sequence of labels dk which corresponds to Sk via the func- 
tion Sfc. Then construct the output digraph g = (V, E) for dk (where E and 
V are the vertex set and edge set, respectively, of g), and go to Step 2. 



In our example, the first term of 5*1 is (1,0). Applying T 
sequence of states 

5^2 = (1,0), (0,1), (1,0), (0,1), (1,0) 
Then, applying S2 to S2, we obtain the sequence of labels 

d2 = 1,2,1,2,1 

and the output digraph for d2', the latter is shown in Fig. 15. 



2, we obtain the 



1 



Fig. 15. Output digraph for the sequence of labels 02 
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5 Simulation 



We performed a number of simulations of a DBN with two nodes. In each 
simulation, we executed 1,000 trials of 10,000 time steps each. The purpose of 
each simulation was to investigate the set of rule vectors actually visited by 
the DBN, which is equivalent to the set of transition matrices actually visited 
by the DBN. 

The number of possible rule vectors for a VBN is 




where S is the set of states and V is the node set. Thus a VBN with two nodes 
has 

(2^')^ = 16^ = 256 

possible rule vectors, so at any given time there are 256 possibilities for the 
rule vector of a DBN with two nodes. 



5.1 Type 1 Simulation 



In this subsection, we present the results of a simulation of a DBN based 
on the construction in section 4, which is hereinafter referred to as a type 1 
simulation. 

The percentage of the 256 possible rule vectors actually visited in each of the 
1,000 trials is shown in Fig. 16. Table 3 shows the corresponding frequency 
distribution (the number of trials as a function of the percentage of rule vectors 
visited). In every trial, the percentage of rule vectors visited was below 70%. 



100 200 300 400 500 600 700 



Fig. 16. Percentage of the 256 rule vectors visited in each of the 1,000 trials of the 
type 1 simulation 
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Table 3 

Frequency distribution of the percentage of rule vectors visited in the type 1 simu- 
lation 



Percentage of rule 


[0,65) 


[65,66) 


[66,67) 


[67,68) 


[68,69) 


[69, 70) 


[70, 100] 


vectors visited, by interval 
















# of trials among 1,000 


1 





20 


429 


493 


57 






The number of time steps at which rule vector (/i, was visited in a typical 
trial of the type 1 simulation with about 68% coverage (i.e., a trial in which 
68% of the 256 rule vectors were visited) is shown in the map in Fig. 17, 
where /i and /2 are the single- node transition rule numbers (from Table 1) 
for nodes 1 and 2, respectively. 




Fig. 17. Number of time steps at which rule vector (/i, /2) was visited in a typical 
trial of the type 1 simulation with about 68% coverage 

The maximum time step at which rule vector (/i, was visited in the type 1 
simulation (where the maximum is taken over all 1,000 trials) is shown in 
the map in Fig. 18. Of the 256 rule vectors, 156 were visited at the very last 
(10,000th) time step of at least one trial; those rule vectors are depicted in 
white in the figure. Red is used for each of the 19 rule vectors that were visited 
at some time step in the range 9,997-9,999 but were never visited at the last 
time step. The remaining 81 rule vectors, which are shown in green, were not 
visited after the fifth time step of any trial. Each of those 81 rule vectors 
was visited in at most 21 of the 1,000 trials, and none of them was visited 
in the 943 trials with less than 69% coverage. Thus we ran two additional 
type 1 simulations: one in which we randomly chose the initial rule vector in 
each trial from the other 175 (= 256 — 81) rule vectors, and one in which 
we eliminated the data for the first five time steps. In those two simulations, 
none of the aforementioned 81 rule vectors was visited at all — in contrast to 
the other 175 rule vectors, each of which was visited in at least 877 of the 
1,000 trials. 
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Fig. 18. Maximum time step at which rule vector (/i,/2) was visited in the first 
type 1 simulation (where the maximum is taken over all 1,000 trials) 



The 81 rule vectors not visited after the fifth time step in the first type 1 
simulation are depicted by green squares in Fig. 19(a). They include rule vector 
(4, 72) for every transition rule /2, and rule vector (/i, 6) for every transition 
rule /i. 



From Table 1, there are four single- node transition rules with exactly one 
virtual incoming node (namely, rules 4, 6, 11, and 13). Note the following 
properties of those four rules: 

(1) Rule 4 is the negation of rule 13, and rule 6 is the negation of rule 11. 

(2) Rules 11 and 13 are rules of an LVBN, but rules 4 and 6 are not. 

Fig. 19(a) is partitioned into sixteen 4x4 blocks, nine of which are colored 
identically to the one shown in Fig. 19(b); the remaining seven blocks are 
totally white. Furthermore, if the block in Fig. 19(b) is turned upside down, 
as shown in Fig. 19(c), the configuration of the nine green squares within 
it is identical to the configuration of the nine partially green blocks within 
Fig. 19(a). That relationship can be seen more readily by comparing Figs. 19(c) 
and 19(d). 

Fig. 20 shows a graph of the cumulative percentage of rule vectors visited in our 
first type 1 simulation, averaged over the 550 trials in which at least 68% of the 
256 rule vectors were visited (i.e., the trials in which at least 175 rule vectors 
were visited), vs. the number of time steps. The blue curve is the result of our 

simulation. Shown in green is the parametric curve ( ^(175,m), ■ 100 1, 

where ^(175, m) is the (theoretical) expectation value of the number of time 
steps taken to visit m rule vectors out of a given set of 175 possible rule 
vectors (the number of rule vectors that were last visited at time step 9,997 — 
or later — in some trial of our simulation) and m varies from 1 to 175: 
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Fig. 19. (a) The 81 rule vectors (indicated by green squares) not visited after the 
fifth time step in any trial of the first type 1 simulation, (b) Configuration of the 
nine green squares in each of the nine partially green 4x4 blocks in (a), (c) Block 
in (b) turned upside down, (d) Configuration of the nine partially green blocks in 
(a), which is identical to the configuration of the nine green squares in (c). 



m 1 yp- 

W7B,m)=g^^j-^ (5) 

TTl 

The quantity ■ 100 is the value of m expressed as a percentage of 256 

(the total number of possible rule vectors for a DBN with two nodes). The 
formula in (5) is obtained by assuming that, at every time step, each of the 

175 possible rule vectors has probability of being visited. The expected 

number of time steps taken to visit 175 rule vectors is ^(175, 175) ~1,005.3. 
However, the number of time steps actually taken to visit an average of 175 
rule vectors (in the 550 trials of our type 1 simulation with at least 68% 
coverage) was about 5,000. 



5.2 Type 2 Simulation 



In our remaining simulations, we used a DBN construction which is similar to 
the one presented in Section 4, the only difference being that, for > 2, we 
chose the transition matrix of the DBN for time step k to be {Qk)~^TQk, 
where T is the transition matrix from Step 4 of our original construction and 
Qk is the matrix representation of some permutation of the 2^ states of a 
DBN with jj, nodes (in the case of our simulations, Pk was some permutation 
of the four states of a DBN with two nodes). We ran three different types 
of such simulations, corresponding to three different ways of choosing the 
permutations. 

In this subsection, we present the results of our type 2 simulation, in which 
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Fig. 20. Blue curve: graph of the cumulative percentage of rule vectors visited 
in the first type 1 simulation, averaged over the 550 trials with at least 68% 
coverage, vs. the number of time steps. Shown in green is the parametric curve 

( 0(175, m), • 100 I , where m varies from 1 to 175 and 0(175, m) is given by (5). 

V 256 / 

Pk (for k >2) was randomly chosen from the following set of permutations: 
{e} U {(1 2), (1 3), (1 4), (2 3), (2 4), (3 4)} 



U{(1 2)(3 4),(1 3)(2 4),(1 4)(2 3)} 

Here, e denotes the identity permutation. Every permutation of the form (a b) 
denotes a transposition (a single 2-cycle), and every permutation of the form 
(a 6)(c d) denotes the product of a disjoint pair of 2-cycles. 

The percentage of the 256 possible rule vectors actually visited in each of the 
1,000 trials of our type 2 simulation is shown in Fig. 21. Table 4 shows the 
corresponding frequency distribution. In every trial, the percentage of rule 
vectors visited was below 71%. 

Table 4 

Frequency distribution of the percentage of rule vectors visited in the type 2 simu- 
lation 



Percentage of rule 
vectors visited, by interval 


[0,67) [67,68) [68,69) [69, 70) [70, 71) 


# of trials among 1,000 


1 33 877 85 4 



The number of time steps at which rule vector (/i, /2) was visited in a typical 
trial of the type 2 simulation with about 68% coverage is shown in the map in 
Fig. 22. There and in each map presented later in this paper, the 81 rule vectors 
from Fig. 19(a) are marked with a star. The associated map of the maximum 
time step at which each rule vector was visited in the type 2 simulation is very 
similar to the map shown in Fig. 18 for the type 1 simulation, so we have not 
presented it here. 
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Fig. 21. Percentage of the 256 rule vectors visited in each of the 1,000 trials of the 
type 2 simulation 
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Fig. 22. Number of time steps at which rule vector (/i, f2) was visited in a typical 
trial of the type 2 simulation with about 68% coverage 

None of the 81 rule vectors from Fig. 19(a) was visited after the sixth time 
step in any trial of our type 2 simulation, and each of those 81 rule vectors was 
visited in at most 14 of the 1,000 trials. Thus we ran two additional type 2 
simulations: one in which we randomly chose the initial rule vector in each 
trial from the other 175 rule vectors, and one in which we eliminated the data 
for the first six time steps. As in type 1, none of those 81 rule vectors was 
visited at all in the two additional simulations — in contrast to the other 175 
rule vectors, each of which was visited in at least 880 of the 1,000 trials. 

Fig. 23 shows a graph of the cumulative percentage of rule vectors visited 
in our first type 2 simulation, averaged over the 966 trials in which at least 
68% of the 256 rule vectors were visited (i.e., the trials in which at least 175 
rule vectors were visited), vs. the number of time steps. Again, the blue curve 
is the result of our simulation, and the green curve is the parametric curve 
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\9{175,m), ■ 100 J. The number of time steps actually taken to visit an 

average of 175 rule vectors (in the 966 trials of our type 2 simulation with 
at least 68% coverage) was about 4,400, which is somewhat smaller than the 
5,000 or so steps taken in our type 1 simulation — and closer to the (theoretical) 
expectation value of about 1,005.3. This difference in the results of the two 
simulations could possibly be due to the use of randomness in selecting the 
permutation that figures in the computation of the transition matrix 
(for time step k) in the type 2 simulation. 
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Fig. 23. Blue curve: graph of the cumulative percentage of rule vectors visited 
in the first type 2 simulation, averaged over the 966 trials with at least 68% 
coverage, vs. the number of time steps. Shown in green is the parametric curve 



6(175, m) 



m 
256 



100 ) , where m varies from 1 to 175 and 0(175, m) is given by (5). 



5.3 Type 3 Simulation 



In this subsection, we present the results of our type 3 simulation, in which 
Pk (for k > 2) was randomly chosen from the set of all 24 permutations of the 
four states of a DBN with two nodes. 

Each of the 256 rule vectors was visited in all 1,000 trials of our type 3 simula- 
tion, and the 81 rule vectors from Fig. 19(a) were visited much more frequently 
in the type 3 simulation than in the type 1 and type 2 simulations. In fact, 
the total number of visits per rule vector (summed over all 1,000 trials of the 
type 3 simulation) was just under two-thirds as large, on average, for those 81 
rule vectors as for the other 175. 

The number of time steps at which rule vector (/i, /2) was visited in a typical 
trial of the type 3 simulation is shown in the map in Fig. 24. 

The maximum time step at which rule vector (/i, /2) was visited in the type 3 
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Fig. 24. Number of time steps at which rule vector (/i,/2) was visited in a typical 
trial of the type 3 simulation (with 100% coverage) 



simulation (where the maximum is taken over all 1,000 trials) is shown in the 
map in Fig. 25. All but 8 of the 256 rule vectors were visited at the very last 
(10,000th) time step of at least one trial; those 248 rule vectors are depicted 
in white in the figure. Red is used for the other 8 rule vectors, each of which 
was visited at the 9,998th or 9,999th time step of at least one trial. 
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Fig. 25. Maximum time step at which rule vector (/i, /2) was visited in the type 3 
simulation (where the maximum is taken over all 1,000 trials) 



Fig. 26 shows a graph of the cumulative percentage of rule vectors visited in 
our type 3 simulation, averaged over all 1,000 trials, vs. the number of time 
steps. The blue curve is the result of our simulation, and the green curve is the 

parametric curve I 6'(256,m), ■ 100 1, where 6'(256,m) is the (theoretical) 

expectation value of the number of time steps taken to visit m rule vectors 
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(out of the set of all 256 possible rule vectors) and m varies from 1 to 256: 



6/(256, m) = ^ 



256 



^ 256 



(6) 



The formula in (6) is obtained by assuming that, at every time step, each of the 
256 possible rule vectors has probability of being visited. The expected 

number of time steps taken to visit 256 rule vectors is ^(256, 256) ~1567.8, 
and the number of time steps actually taken to visit all 256 rule vectors (in 
all 1,000 trials of our type 3 simulation) was about 1,650. 
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Fig. 26. Blue curve: graph of the cumulative percentage of rule vectors visited in 
the type 3 simulation, averaged over all 1,000 trials, vs. the number of time steps. 



m 



Shown in green is the parametric curve I 0(256, m), • 100 I , where m varies from 

V 256 / 

1 to 256 and 0(256, m) is given by (6). 

We also ran a simulation in which was randomly chosen from the comple- 
ment of the set of permutations used in type 2: 

{(1 2 3), (1 2 4), (1 3 2), (1 3 4), (1 4 2), (1 4 3), (2 3 4), (2 4 3)} 

|J{(1 2 3 4), (1 2 4 3), (1 3 2 4), (1 3 4 2), (1 4 2 3), (1 4 3 2)} 

That is, Pfc was chosen to be either a 3-cycle or a 4-cycle. The results were 
similar to those for our type 3 simulation, so we have not presented them here. 



5.4 Type 4 Simulation 



In this subsection, we present the results of our type 4 simulation, in which per- 
mutation Pfc (for k >2) was constructed from the labeling function 3^:3^ A, 
as follows: 

(i) For every / G A, let Ski = {s e S : Sfc(s) = 1} and rki = \Ski\. 
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i) For every / with rki > 1, randomly choose a permutation pki of the states 
in Ski such that pki is an r^z-cycle. Then let Pk consist of the product of all 
the cycles thus chosen: 

Pk= n 

I e A 

Tkl > 1 

In our example, the following is what transpires at time step 2 in construct- 
ing P2 from the labeling function 



(0,0) (0,1) (1,0) (1,1) 



• For / = 1, S21 = {(1,0)}. Thus r2i = 1, so no permutation is chosen for 
/ = 1. 

• For / = 2, = {(0,0), (0,1), (1,1)}. Thus r2i = 3, so we randomly 
choose a 3-cycle P21 of the three states in 821- In the lexicographical order- 
ing scheme, they are states 1, 2, and 4, so there are two such 3-cycles to 
choose from: (124) and (142). We choose (142) a.s p2i- 

• For / = 3 and / = 4, 5*2; = 0, so no permutation is chosen for either of 
them. 

(ii) We obtain the permutation P2 = (14 2). The matrix representation of P2 is 

^0 1^ 
10 

Q2 = 

10 

^0 1 Oy 

Thus we obtain the transition matrix 



T2 = {Q2r'TQ2 = 



10 0^ 




^1 0^ 




^0 1^ 




^0 10^ 


1 




10 




10 




10 


10 




10 




10 




10 


1 Oy 




^0 ly 




^0 1 Oy 




^0 ly 



The percentage of the 256 possible rule vectors actually visited in each of the 
1,000 trials of our type 4 simulation is shown in Fig. 27. Table 5 shows the 
corresponding frequency distribution. In 906 of the 1,000 trials, at least 90% 
of the rule vectors were visited; also, every rule vector was visited in at least 
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775 trials. Furthermore, the total number of visits per rule vector (summed 
over all 1,000 trials of the type 4 simulation) was nearly twice as large, on 
average, for the 81 rule vectors from Fig. 19(a) than for the other 175 rule 
vectors. 




'^0 100 200 300 400 500 600 700 800 900 1000 



Fig. 27. Percentage of the 256 rule vectors visited in each of the 1,000 trials of the 
type 4 simulation 

Table 5 

Frequency distribution of the percentage of rule vectors visited in the type 4 simu- 
lation 



Percentage of rule 
vectors visited, by interval 


[0,50) 


[50,60) 


[60,70) 


[70,80) 


[80,90) 


[90,100) 100 


# of trials among 1,000 


26 


5 


16 


20 


27 


356 550 



The 1,000 trials of the type 4 simulation can be grouped into two classes: 

Class (i) trials in which a handful of rule vectors were visited very frequently (visited 
at roughly 1,000 or more different time steps) and the remaining rule vectors 
were visited very infrequently (visited at roughly no more than 30 different 
time steps) 

Class (ii) trials in which the variation in the number of visits per rule vector was very 
small 

About 70% of the trials of the type 4 simulation fall into class (i); the remaining 
30% are in class (ii). All 94 trials with less than 90% coverages are in class (i). 

The numbers of time steps at which rule vector (/i,/2) was visited in two 
specific trials of the type 4 simulation with at least 90% coverage apiece are 
shown in the maps in Fig. 28. The map in (a) is for a typical class (i) trial, 
and the map in (b) is for a typical class (ii) trial. 

Only six rule vectors (namely, (4,11), (6,7), (6,10), (7,4), (10, 4), and (13, 6)) 
were visited very frequently (visited at roughly 1,000 or more different time 
steps) in the class (i) trial associated with Fig. 28(a). White is used for the 
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remaining 250 rule vectors, each of which was visited no more than 22 times. 
In about 20% of the class (i) trials, at least one rule vector other than those six 
was among the rule vectors visited very frequently (visited at roughly 1,000 
or more different time steps). 

The map shown in Fig. 28(b), for a class (ii) trial, bears a much greater 
resemblance to the maps for the trials of the type 3 simulation than to those 
for the class (i) trials of the type 4 simulation. 



(a) (b) 
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Fig. 28. Number of time steps at which rule vector (/i, /2) was visited in two trials 
of the type 4 simulation with at least 90% coverage apiece: (a) a typical class (i) 
trial and (b) a typical class (ii) trial 

All six of the rule vectors enumerated earlier are from Fig. 19(a). Each of 
those six rule vectors was visited a total of more than 300,000 times in the 
class (i) trials of the type 4 simulation, while each of the 175 rule vectors not 
from Fig. 19(a) had a total of fewer than 50,000 visits in the class (i) trials. 

The total number of visits (in units of 10^) of rule vector (/i, /2) in the class 
(i) trials of the type 4 simulation (summed over all 1,000 trials) is shown in 
the map in Fig. 29(a). 

Only 54 of the 256 rule vectors were visited very frequently in at least one of 
the class (i) trials: 41 of the 81 rule vectors from Fig. 19(a), including the six 
rule vectors enumerated earlier; and 13 of the other 175 rule vectors. Those 
54 rule vectors are depicted by red squares in Fig. 29(b). 

The maximum time step at which rule vector (/i, /2) was visited in the type 4 
simulation (where the maximum is taken over all 1,000 trials) is shown in 
the map in Fig. 30. Of the 256 rule vectors, 203 were visited at the very last 
(10,000th) time step of at least one trial; those rule vectors are depicted in 
white in the figure. Red is used for the other 53 rule vectors, each of which 
was visited at some time step in the range 9,992-9,999 but was never visited 
at the last time step. 
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Fig. 29. (a) Total number of visits (in units of 10^) of rule vector (/i,/2) in the 
class (i) trials of the type 4 simulation, (b) The 54 rule vectors (indicated by red 
squares) that were visited very frequently in at least one of the class (i) trials. 




Fig. 30. Maximum time step at which rule vector (/i, 72) was visited in the type 4 
simulation (where the maximum is taken over all 1,000 trials) 



Fig. 31 shows a graph of the cumulative percentage of rule vectors visited 
in our type 4 simulation, averaged over the 550 trials in which all 256 rule 
vectors were visited, vs. the number of time steps. Again, the blue curve is the 
result of our simulation, and the green curve is the parametric curve given by 

6'(256,m), ■ 100 J. The number of time steps actually taken to visit all 

256 rule vectors (in the 550 trials of our simulation with 100% coverage) was 
about 4,500, which is about 2.9 times as large as the theoretical value and 2.7 
times the number of steps taken in the type 3 simulation. 
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Fig. 31. Blue curve: graph of the cumulative percentage of rule vectors visited in the 
type 4 simulation, averaged over the 550 trials with 100% coverage, vs. the number 

0(256, m), • 100 ) , where 

256 / 

m varies from 1 to 256 and 9(256, m) is given by (6). 
6 Concluding Remarks 



At each time step in our simulations of a DBN, we incorporated randomness 
into several processes: in generating pseudo-transition diagrams from the out- 
put digraph, in choosing a pseudo-transition diagram and then transforming 
it to the transition diagram of a VBN, in choosing a labeling function for the 
states of the DBN, and (in types 2, 3, and 4) in choosing a permutation of the 
states of the DBN. 

In every trial of the type 1 simulation, fewer than 70% of the rule vectors 
were visited. Furthermore, the 256 rule vectors split into two groups: the set 
of 81 rule vectors that were not visited after the fifth time step in any trial 
of the first type 1 simulation (and not visited at all in the two additional 
type 1 simulations), and the set consisting of the other 175 rule vectors, each 
of which was visited considerably more frequently than those in the set of 
81. Those features of the type 1 simulation were largely preserved under the 
kinds of permutations of the states of the DBN which were chosen at random 
in the type 2 simulation (namely, the identity permutation, the 2-cycles, and 
the products of disjoint pairs of 2-cycles). 

The salient features of the type 1 and type 2 simulations were not preserved 
under the kinds of permutations chosen in the type 3 and type 4 simulations. 
(Any permutation could be selected in type 3; however, the permutation Pk 
selected at time step k in the type 4 simulation depended in part on the 
labeling function S^.) A remarkable contrast was found between the results 
of the type 3 and type 4 simulations on one hand, and those for type 1 and 
type 2 on the other. All 256 rule vectors were visited in every trial of the type 3 
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simulation, and six of the 81 rule vectors which were not visited after the sixth 
time step of any trial in the type 1 and type 2 simulations were the rule vectors 
visited the most frequently of all — by far — in the type 4 simulation. 

The results of the one simulation we ran in which was randomly chosen 
from the complement of the set of permutations used in type 2 were similar 
to those for our type 3 simulation, in that the total number of visits per rule 
vector varied little from one rule vector to another. 

Fully 41 of the aforementioned 81 rule vectors were visited very frequently 
(visited at roughly 1,000 or more different time steps) in at least one class (i) 
trial of our type 4 simulation. Six of those 41 rule vectors had a total of over 
300,000 visits in the type 4 simulation. 

Each component of every one of those six rule vectors is a single-node tran- 
sition rule from the set {4,6,7,10,11,13}. Three of those (rule numbers 7, 
11, and 13) are transition rules of an LVBN; each of the other three (4, 6, 
and 10) is the negation of a single-node transition rule of an LVBN. The only 
single-node transition rules of an LVBN which are not a component of any of 
those six rule vectors are rule numbers 1 and 16 — and both of those transition 
rules correspond to a node of an LVBN that has no virtual incoming nodes. 
Thus it appears that we can regard the class (i) trials of the type 4 simulation 
as a restriction of the single-node transition rules of a VBN to the single-node 
transition rules of an LVBN and their negations, hence that linearity may have 
played a significant role in our type 4 simulation. Therefore, linearity may be 
a worthwhile topic for further study. 
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